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Abstract. 

The FAdo system is a symbolic manipulator of formal languages objects, implemented in 
Python. In this work, we extend its capabilities by implementing methods to manipulate trans¬ 
ducers and we go one level higher than existing formal language systems and implement methods 
to manipulate objects representing classes of independent languages (widely known as code prop¬ 
erties). Our methods allow users to define their own code properties and combine them between 
themselves or with hxed properties such as prefix codes, suffix codes, error detecting codes, etc. 
The satisfaction and maximality decision questions are solvable for any of the definable proper¬ 
ties. The new online system LaSer allows to query about code properties and obtain the answer 
in a batch mode. Our work is founded on independence theory as well as the theory of rational 
relations and transducers and contributes with improveded algorithms on these objects. 
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1 Introduction 

Several programming platforms are nowadays available, providing methods to transform and ma¬ 
nipulate various formal language objects: Grail/Grail-|- [26,33], Vaucanson 2 [6,34], FAdo [2,10], 
OpenFST [19], JFLAP [19]. Some of these systems allow one to manipulate such objects within 
simple script environments. Grail for example, one of the oldest systems, provides a set of fil¬ 
ters manipulating automata and regular expressions on a UNIX command shell. Similarly, FAdo 
provides a set of methods manipulating such objects on a Python shell [25]. 

Software environments for symbolic manipulation of formal languages are widely recognized as 
important tools for theoretical and practical research. They allow easy prototyping of new algo¬ 
rithms, testing algorithm performance with large datasets, corroborate or disprove descriptional 
complexity bounds for manipulations of formal systems representations, etc. A typical example 
is, for a given operation on regular languages, to find an upper bound for the number states of a 
minimal deterministic finite automata (DFA) for the language that results from the operation, as 
a function of the number of states of the minimal DFAs of the operands. Due to the combinatorial 
nature of formal languages representations this kind of calculations are almost impossible without 
computational add. 

In this work, we extend the capabilities of FAdo and LaSer [9,17] by implementing transducer 
methods and by going to the higher level of implementing objects representing classes of indepen¬ 
dent formal languages, also known as code properties. More specifically, the contributions of the 
present paper are as follows. 

1. Implementation of transducer objects and state of the art transducer methods. FAdo is 
a regularly maintained and user friendly Python package that was lacking transducer ob¬ 
jects. Now available are general transducers as well as transducers in standard and normal 
forms. Some important methods that have been implemented are various product construc¬ 
tions between transducers and between transducers and automata, as well as a transducer 
functionality test. 
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2. Definitions of objects representing code properties and methods for their manipulation, which 
to our knowledge is a new development in software related to formal language objects. In 
simple mathematical terms, a code property is a class of languages that is closed for maximal 
languages. In addition to some fixed known code properties (such as prefix code, suffix code, 
hypercode), these methods can be used to construct new code properties, including various 
error-detecting properties, which are specified either via a trajectory regular expression [7] or 
via a transducer description [9]. Moreover, our methods can be used to combine any defined 
properties and produce a new property that is the conjunction of the defined properties. 

3. Enhancement and implementation of state of the art decision algorithms for code proper¬ 
ties of regular languages. In particular, many such algorithms have been implemented and 
enhanced so as to provide witnesses (counterexamples) in case of a negative answer, for ex¬ 
ample, when the given regular language does not satisfy the property, or is not maximal with 
respect to the property. To our knowledge such implementations are not openly available. In 
particular, the satisfaction of the classic property of unique decipherability, or decodability, 
is implemented for any given NFA based on the algorithm in [12] as well as the satisfaction 
of various error-detecting properties [16]. 

4. A mathematical definition of what it means to simulate (and hence implement) a hierarchy 
of properties and the proof that there is no complete simulation of the set of error-detecting 
properties. 

5. Generation of executable Python code based on the requested question about a given code 
property. This is mostly of use in the online LaSer [17], which receives client requests and 
attempts to compute answers. However, as the algorithm required to compute an answer 
can take a long time, LaSer provides the option to compute and return a self-contained 
executable Python program that can be executed at the client’s machine and return the 
required answer. 

6. All the above classes and methods are open source (GPL): available for anyone to copy, use 
and modify to suit their own application. 

Our work is founded on independence theory [15,32] as well as the theory of rational relations and 
transducers [4,28] . We present our algorithmic enhancements in a detailed mathematical manner 
with two aims in mind: first to establish the correctness of the enhanced algorithms and, second 
to allow interested readers to obtain a deeper understanding of these algorithms, which could 
potentially lead to further developments. 

The paper is organized as follows. 

Section 2 contains some basic terminology and background about various formal language con¬ 
cepts as well as a few examples of manipulating FAdo automata in Python. 

Section 3 describes our implementation of transducer object classes and a few basic methods 
involving product constructions and rational operations. 

Section 4 describes the decision algorithm for transducer functionality, and then our enhance¬ 
ment so as to provide witnesses when the transducer in question is not functional. 

Section 5 describes our implementation of code property objects and basic methods for their 
manipulation. Moreover, a mathematical approach to defining syntactic simulations of infi¬ 
nite sets of properties is presented, explaining that a linear simulation of all error-detecting 
properties exists, but no complete simulation of these properties is possible. 

Section 6 continues on code property methods by describing our implementation of the satis¬ 
faction and maximality methods. Again, we describe our enhancements so as to provide 
witnesses when the answer to the satisfaction/maximality question is negative. 
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Section 7 describes the implementation of the unique decodability (or decipherability) property 
and its satisfaction and maximality algorithms. This property is presented separately, as it 
is a classic one that cannot be defined within the methods of transducer properties. 

Section 8 describes the new version of LaSer [17] including the capability to generate executable 
programming code in response to a client’s request for the satisfaction or maximality of a 
given code property. 

Section 9 contains a few concluding remarks including directions for future research. 


2 Terminology and Background 

Sets, alphabets, words, languages. We write N,No,Z,IR for the sets of natural numbers (not 
including 0), non-negative integers, integers, and real numbers, respectively. If 5" is a set, then [S'] 
denotes the cardinality of S, and 2'^ denotes the set of all subsets of S. An alphabet is a finite 
nonempty set of symbols. In this paper, we write E, A for any arbitrary alphabets. If g G N, then 
Eg denotes the alphabet {0,1,..., g — 1}. The set of all words, or strings, over an alphabet E is 
written as E*, which includes the empty word e. A language (over E) is any set of words. In the 
rest of this paragraph, we use the following arbitrary object names: i, j for nonnegative integers, 

K, L for languages and u,v,w,x,y for words. If w G L then we say that w is an L-word. When 
there is no risk of confusion, we write a singleton language {w} simply as w. For example, LU w 
and uUw mean L\j{w} and {u}U{ry}, respectively. We use standard operations and notation on 
words and languages [13,21,27,36]. For example, uv, KL, L*, L*, L+ represent respectively, 
the concatenation of u and v, the word consisting of i copies of w, the concatenation of K and L, 
the language consisting of all words obtained by concatenating any i L-words, the Kleene star of 

L, and = L* \s. If w is of the form uv then u is a prefix and u is a suffix of w. If w is of the 
form uxv then x is an infix of w. If u w then u is called a proper prefix of w —the definitions 
of proper suffix and proper infix are similar. 

Codes, properties, independent languages, maximality. A property (over E) is any set V 
of languages. If L is in 7^ then we say that L satisfies V. Let Hq denote the cardinality of N. A 
code property, or independence, [15], is a property V for which there is n G N U {Hg} such that 

L G V, if and only if L' G V, for all L' C L with 0 < [L'j < n, 

that is, L satisfies the property exactly when all nonempty subsets of L with less than n elements 
satisfy the property. In this case, we also say that V is an n-independence. In the rest of the paper 
we only consider properties V that are code properties. A language L G V is called V-maximal, 
or a maximal V code, if LUw ^ V for any word w ^ L. From [15] we have that every L satisfying 
V is included in a maximal V code. To our knowledge, all known code related properties in the 
literature [5,7,9,11,15,24,31,38] are code properties as defined above. For example, consider the 
‘prefix code’ property: L is a prefix code if no word in L is a proper prefix of a word in L. This is 
a code property with n = 3. Indeed, let V' be the set of all singleton languages union with the set 
of all languages {u, u} such that u is not a proper prefix of v and vice versa. Then, every element 
in V' is a prefix code. Moreover any L is a prefix code if and only if any nonempty subset C 
of L with less than three elements is in V'. As we shall see further below the focus of this work 
is on 3-independence properties that can also be viewed as independent with respect to a binary 
relation in the sense of [32]. 

Automata and regular languages [28, 37]. A nondeterministic finite automaton with empty 
transitions, for short automaton or e-NFA, is a quintuple 

Si={Q,^,T,I,F) 

such that Q is the set of states, E is an alphabet, I,F C Q are the sets of start (or initial) 
states and final states, respectively, and T C Q x (E U e) x Q is the finite set of transitions. Let 


3 


(p, X, q) be a transition of a. Then x is called the label of the transition, and we say that p has an 
outgoing transition (with label x). A path of a is a finite sequence (poi xi,pi, ..., X(,p(,), for some 
nonnegative integer £, such that each triple {pi-i,Xi,pi) is a transition of a. The word xi ■ ■ ■ Xi is 
called the label of the path. The path is called accepting if po is a start state and pi is a final state. 
The language accepted by a, denoted as L(a), is the set of labels of all the accepting paths of a. 
The £-NFA a is called trim, if every state appears in some accepting path of a. The automaton a 
is called an NFA, if no transition label is empty, that is, T C Q x S x Q. A deterministic finite 
automaton, or DFA for short, is a special type of NFA where / is a singleton set and there is 
no state p having two outgoing transitions with equal labels. The size |a| of the automaton a is 
IQI + \T\. The automaton results when we add e-loops in a, that is, transitions {p,s,p) for all 
states p G Q. 

Transducers and (word) relations [4, 28, 37]. A (word) relation over S and A is a subset of 
S* X A*, that is, a set of pairs {x, y) of words over the two alphabets (respectively). The inverse of 
a relation p, denoted as is the relation {(]j,x) \ (x,y) G p}. A (finite) transducer is a sextuple 

t = (Q,E,A,T,/,F) 

such that Q, I, F are exactly the same as those in e-NFAs, S is now called the input alphabet, A 
is the output alphabet, and T C Q x E* x A* x Q is the finite set of transitions. We write {p,x/y, q) 
for a transition - the label here is (x/y), with x being the input and y being the output label. 
The concepts of path, accepting path, and trim transducer are similar to those in e-NFAs. In 
particular the label of a path (p[),xi/yi,pi, ..., x^/yi,pi) is the pair {xi ■ ■ • xi, yi • • • yi) consisting 
of the input and output labels in the path. The relation realized by the transducer t, denoted as 
R(t), is the set of labels in all the accepting paths of t. We write t(a:) for the set of possible outputs 
of t on input x, that is, y € t(a;) iff {x,y) G R(t). The domain of t is the set of all words w such 
that t{w) ^ 0. The inverse of a transducer t, denoted as t“^, is the transducer that results from t 
by simply switching the input and output alphabets of t and also switching the input and output 
parts of the labels in the transitions of t. It follows that realizes the inverse of the relation 
realized by t. The transducer t is said to be in standard form, if each transition (p, x/y, q) is such 
that X G (E U e) and y € (A U e). It is in normal form if it is in standard form and exactly one of 
X and y is equal to e. We note that every transducer is effectively equivalent to one (realizing the 
same relation, that is) in standard form and one in normal form. As in the case of automata, the 
transducer results when we add e-loops in t, that is, transitions {p,e/e,p) for all states p G Q. 
The size of a transition {p,x/y,q) is the number 1 -|- |x| -I- |y|. The size |t| of the transducer t is 
the sum of the number of states and sizes of transitions in T. If s and t are transducers, then 
there is a transducer s V t of size 0(|s| -|- |t|) realizing R(s) U R(t). 

Automata and finite languages in FAdo [10]. The modules fa for automata, fl for finite 
languages, and fio for input/output of formal language objects, can be imported in a standard 
Python manner as follows. 

import FAdo.fl as fl 
import FAdo.fio as fio 

from FAdo.fa import * # import all fa methods for readability 

The FAdo object classes FL, DFA and NFA manipulate finite languages, DFAs and e-NFAs, respec¬ 
tively. 

Example 1. The following code builds a finite language object L from a list of strings, and then 
builds an NFA object a accepting the language {a, ab, aab}. 

1st = [’a’, ’ab’, ’aab’] 

L = fl.FL(lst) 
a = L.toNFAO 

The second line uses the class name FL to create the finite language L from the given list of strings. 
The last line returns an NFA accepting the language L. □ 
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The following code reads an automaton, or transducer, a from a file, and then writes it into a file, 
a = fio.readOneFromFile(filenamiel) 

fio.saveToFile(filename2, a) 

These methods assume that automata and transducers are written into those files in FAdo format— 
see examples below and further ones in [10,17]. We can also read an automaton or transducer 
from a string s using the function readOneFromString(s). 

Example 2. The following code defines a string that contains an automaton accepting a*b and 
then uses that string to define an NFA object. 

St = ’@NFA 1 * 0\n0 a 0\n0 b l\n’ 
a = fio.readDneFromString(st) 

As usual, the pattern \n denotes the end of line character, so the string st consists of three lines: 
the first indicates the type of object followed by the final states (in this case 1) and the start states 
after * (in this case 0); the second line contains the transition 0 a 0; and the third line contains 
the transition Obi. The string has to end with a \n. □ 

Next we list a few useful methods for NFA objects. We assume that a and b are automata, and w 
is a string (a word). 

a. evalWordP(w) : returns True or False, depending on whether the automaton a accepts w. 
a.emptyPO: returns True or False, depending on whether the language accepted by a is empty, 
a & b: returns an NFA accepting L(a) n L(b). Both a and b must be NFAs with no e-transitions, 
a. elimEpsilonO : alters a to an equivalent NFA with no e-transitions, 
a.epsilonPO : returns True or False, depending on whether a has any e-transitions. 
a.makePNG(fileName=’xyz’ ): creates a file xyz.jpg containing a picture of the automaton (or 
transducer) a. 

a. enumNFA(n) : returns the set of words of length up to n that are accepted by the automaton a. 

Example 3. The following example shows a naive implementation of a. evalWordP(w) 

b = fl.FL( [w] ) .toNFAO 
c = a & b 

return not c.emptyPO 

One verifies that w is accepted by a if and only if a intersected with the automaton accepting {w} 
accepts something. □ 

3 Transducer Object Classes and Methods 

In this section we discuss some aspects of the implementation of transducer objects and then 
continue with several important methods, which we divide into two subsections: product con¬ 
structions and rational operations. We discuss the method for testing functionality in Section 4. 
The module containing all that is discussed in this and the next section is called transducers .py. 
We can import all transducer methods as follows. 

from FAdo.transducers import * 

Transducer objects and basic methods 
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From a mathematical point of view, a Python dictionary is an onto function delta : D ^ R 
between two finite sets of values D and R. One writes delta [x] for the image of delta on x. 
One can define completely the dictionary using an expression of the form 

delta = {dl:rl, d2:r2, ... , dN;rN} 

which defines delta [df] = ri, for all f = 1,..., N. In FAdo, the class GFT, for General Form 
Transducer, is a subclass of NFA. A transducer t = (Q, E, A, T, I, F) is implemented as an object 
t with six instance variables 

States, Sigma, Output, delta. Initial, Final 

corresponding to the six components of t. Specifically, States is a list of unique state names, 
meaning that each state name has an index which is the position of the state name in the list, 
with 0 being the first index value. The variables Sigma, Output, Initial and Final are sets, 
where the latter two are sets of state indexes. For efficiency reasons, the set of transitions T is 
implemented as a dictionary 

delta; {0, ..., n - 1} ^ (Sigma ^ 

where n is the number of states. Thus, for any p S {0,...,n — 1}, delta [p] is a dictionary, 
and for any input label a;, delta [p] [a;] is a set of pairs {y,q) corresponding to all transitions 
{{p,x/y,q) G T I y S Output, q is a state index}. 

Standard form transducers are objects of the FAdo class SFT, which is a subclass of GFT. The class 
SFT is very important from an algorithmic point of view, as most product constructions require a 
transducer to be in standard form. The conversion from GFT to SFT is done using 

s = t.toSFTO 

which assigns to s a new SFT object equivalent to t. The implementation of Normal Form Trans¬ 
ducers is via the FAdo class NFT. This form of transducers is convenient in proving mathematical 
statements about transducers [28]. 

Example 4. The following code defines a string s that contains a transducer description and 
then constructs an SFT transducer from that string. The transducer, on input cc, returns the set 
of all proper suffixes of x —see also Fig 1. It has an initial state 0 and a final state 1, and deletes 
at least one of the input symbols so that what gets outputted at state 1 is a proper suffix of the 
input word. 

s = ’OTransducer 1 * 0\n’\ 

’0 a Oepsilon 0\n’\ 

’0 b Oepsilon 0\n’\ 

’0 a Oepsilon l\n’\ 

’0 b Oepsilon l\n’\ 

’1 a a l\n’\ 

G b b l\n’ 

t = fio.readOneFromString(s) 

□ 


Recall, for a transducer t and word ic, t(w) is the set of possible outputs of t on input w. Note 
that this set could be empty, finite, or even infinite. In any case, it is always a regular language. 
The FAdo method t. runOnWord(w) assumes that t is an SFT object and returns an automaton 
accepting the language t(w). 

Example 5. The following code is a continuation of Example 4. It prints the set of all proper 
suffixes of the word ababb. 


a = t.runOnWord( ’ ababb’) 
n = len(’ababb’) 
print a.enmnNFA(n) 
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Figure 1: On input x the transducer shown in the hgure outputs any proper suffix 
of X. Note: In this and the following transducer figures, the input and output 
alphabets are equal. An arrow with label o ja represents a set of transitions with 
labels cr/cr, for all alphabet symbols tr; and similarly for an arrow with label cr/e. 
An arrow with label cr/tr' represents a set of transitions with labels ojo' for all 
distinct alphabet symbols a,a'. 


□ 

Assuming again that t is an SFT object, we have the following methods, 
t. inverse 0: returns the inverse of the transducer t. 

t. evalWordP( (w,w’ )): returns True or False, depending whether the pair (w,w’) belongs to 
the relation realized by t. 

t .nonEmptyWO : returns some word pair (u, v) which belongs to the relation realized by t, if 
nonempty. Else, it returns the pair (None, None). 

t.toInNFAO: returns the NFA that results if we remove the output alphabet and the output 
labels of the transitions in t. 

t. toOutNFAO : returns the NFA that results if we remove the input alphabet and the input labels 
of the transitions in t. 

Product constructions [4, 16, 37] 

The following methods are available in FAdo. They are adaptations of the standard product 
construction [13] between two NFAs which produces an NFA with transitions ((pi,P 2 ), cr, (qi, 92 )), 
where (pi,CT, gi) and (p 2 ,cr, 52 ) are transitions of the two NFAs, such that the new NFA accepts 
the intersection of the corresponding languages. We assume that t and s are SFT objects and a 
is an NFA object. 

t. inlntersection(a) : returns a transducer realizing all word pairs (x, y) such that x is accepted 
by a and {x,y) is realized by t. 

t. outlntersection(a) : returns a transducer realizing all word pairs {x, y) such that y is accepted 
by a and {x,y) is realized by t. 

t. runOnNFA(a) : returns the automaton accepting the language 

U 

ii;GL(a) 

t. composition(s) : returns a transducer realizing the composition R(t) o R(s) of the relations 
realized by the two transducers. 

To make the presentation a little more concrete for interested readers, we comment on one of 
the above methods, t . runOnNFA(a) . The construction first considers whether t has any e-input 
transitions, that is transitions with labels e/y. If yes, then a copy b of a is made with e-loops 
added, that is, transitions {p,e,p) for all states p in a. Then, if a has any e-transitions then a 
copy s of t is made with e-loops added, that is, transitions {q,e/e,q) for all states q in t. Then, 
the actual product construction is carried out: if (p,x,p') and {q,x/y,q') are transitions of b and 
s, respectively, then ((p, q), y, (p', q')) is a transition of the resulting automaton. 
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Example 6. Continuing Examples 2 and 4, the following code constructs an NFA object b ac¬ 
cepting every word that is a proper suffix of some word in a*b. It then enters a loop that prints 
whether a given string ic is a suffix of some word in a*b. 

b = t.runDnNFA(a) 
while True; 

w = raw_input 0 

if ’a’ not in w auid ’b' not in w: break 
print b.evalWordP(w) 


□ 

Rational operations [4^] 

A relation p is a rational relation, if it is equal to 0, or {{x, j/)} for some words x and y, or can be 
obtained from other ones by using a finite number of times any of the three (rational) operators: 
union, concatenation, Kleene star. A classic result on transducers says that a relation is rational if 
and only if it can be realized by a transducer. The following methods are now available in FAdo, 
where we assume that s and t are SFT transducers. 

t.union(s): returns a transducer realizing the union of the relations realized by s and t. 

t.concat(s): returns a transducer realizing the concatenation of the relations realized by s 
and t. 

t. star (f lag=False) : returns a transducer realizing the Kleene star of the relation realized by 
t, assuming the argument is missing or is False. Else it returns a transducer realizing (R(t))+. 

The implementation of the above methods mimics the implementation of the corresponding meth¬ 
ods on automata. 

4 Witness of Transducer non-functionality 

A transducer t is called functional if, for every word w, the set t(r(;) is either empty or a singleton. 
A triple of words (w,z,z') is called a witness oft’s non-functionality, ii z ^ z' and z,z' G t{w). 
In this section we present the SFT method t .nouFunctionalWO, which returns a witness of t’s 
non-functionality, or the triple (None, None, None) if t is functional. The method is an adaptation 
of the decision algorithms in [1,3] that return whether a given transducer in standard form is func¬ 
tional. Although there are some differences in the two algorithms, we believe that conceptually the 
algorithmic technique is the same. We first describe that algorithmic technique following the pre¬ 
sentation in [3], and then we modify it in order to produce the method t .nonFunctionalWO. We 
also note that, using a careful implementation and assuming fixed alphabets, the time complexity 
of the decision algorithm can be quadratic with respect to the size of the transducer—see [1] . 
Given a standard form transducer t = {Q, S, A, T, I, F), the first phase is to construct the square 
machine u, which is defined by the following process. 

Phase 1 

1. First define an automaton u' as follows: states Q x Q, initial states I x I, and final states 
F X F. 

2. If t contains e-input transitions, that is, transitions with labels of the form e/u then we let 
t be t®. The transitions of u' are all the triples 

{{P,P'), (x,x'), (q,q')) 

such that (jp,v/x,q) and {p' ,v/x' ,q') are transitions of t. 

3. Return u = a trim version of u'. 


Note that any accepting path of u has a label {xi (xn, x'„) such that the words xi ■ ■ ■ Xn 

and x[ ■ ■ ■ x'^ are outputs (possibly equal) of t on the same input word. The next phase is to 
perform a process that starts from the initial states and assigns a delay value to each state, which 
is either ZERO or a pair of words in {(e, e), (e, u), (u, e)}, with u being nonempty. A delay (y, y') 
on a state {p,p') indicates that there is a path in u from I x I to {p,p') whose label is a word pair 
of the form {fy,fy')- This means that there is an input word that can take the transducer t to 
state p with output fy and also to state p' with output fy'. A delay ZERO at {p,p') means that 
there is an input word that can take t to state p with output of the form fag and to state p' with 
output of the form fa'g', where a and a' are distinct alphabet symbols. 

Phase 2 

4. Assign to each initial state the delay value {e,e). 

5. Starting from the initial states, visit all transitions in breadth-first search mode such that, 
if {p,p') has the delay value (y, y') and a transition ((p,p'), (x, x'), (y, q')) is visited, then the 
state (y, q') gets a delay value D as follows: 

• If y'x' is of the form yxu then D = (e, u). If yx is of the form y'x'u then D = {u, e). If 
y'x' = yx then D = (e,e). Else, D = ZERO. 

6 . The above process stops when a delay value is ZERO, or a state gets two different delay 
values, or every state gets one delay value. 

7. If every state has one delay value and every final state has the delay value (e, e) then return 
True (the transducer is functional). Else, return False. 

Next we present our witness version of the transducer functionality algorithm. First, the square 
machine u is revised such that its transitions are of the form 


{{P-,p’)Av,x,x’),{q,q')), 


that is, we now record in u information about the common input v (see Step 2 in Phase I). Then, 
to each state (y, q') we assign not only a delay value but also a path value {a, /3, /3') which means 
that, on input a, the transducer t can reach state y with output /3 and also state y' with output 
/?'—see Fig. 2. 



Figure 2: The figure shows the structure of the (revised) square machine u corre¬ 
sponding to the given transducer t. The delay and path values for the states of u 
are explained in Definition 7 
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Definition 7. Let {q, q') be a state of the new square machine u. The set of delay-path values of 
{q,q') is defined as follows. 

• If {q,q') is an initial state then ((e,e), {e,e,e)) is a delay-path value of {q,q'). 

• If {v, X, x'), {q, q')) is a transition in u and (jp,p') has a delay-path value (C, (w, z, z')), 

then {D, {wv, zx, z'x')) is a delay-path value of {q, q'), where D is defined as follows. 

1. li C = {y, y') ^ ZERO and y'x' is of the form yxu then D = (e, u). 

2. li C = {y,y') ^ ZERO and yx is of the form y'x'u then D = {u,e). 

3. lie = (y, y') 7 ^ ZERO and y'x' = yx then D = (e,e) 

4. Else, D = ZERO. 

For (y, g'), we also define a suffix triple {wqq/, Zqq', z'^^,) to be the label of any path from [q, q') to 
a final state of u. 

Remark 8. The above definition implies that if a state (p,p') has a delay-path value (C, (w, z, z')), 
then there is a path in u whose label is {w, z, z'). Moreover, by the definition of u, the transducer 
t on input w can reach state p with output z and also state p' with output z'. Thus, if {p,p') is a 
final state, then z, z' G t{w). 

Algorithm nonFunctionalW 

1. Define function completePath(q, q') that follows a shortest path from (y, q') to a final state 
of u and returns a suffix triple (see Definition 7). 

2. Construct the revised square machine u, as in Phase 1 above but now use transitions of the 
form 

{{p,p')A'>^^x,x'),{q,q')), 

(see step 2 in Phase 1). 

3. Assign to each initial state the delay-path value ((e, e), (e, £,e)). 

4. Starting from the initial states, visit all transitions in breadth-first search mode. If {p,p') has 
delay-path value ((y, y'), (w, z, z')), and a transition ((p,p'), (v, x, x'), (y, q')) is visited, then 
compute the delay value D of (y, y') as in steps 1-4 of Definition 7, and let R = {wv, zx, z'x'). 
Then, 

(a) if D is ZERO, then invoke completePath (y, y') to get a suffix triple {wqq>, Zqq>, z'^^,) 
and return {wvwqqi, zxzqq>, z'x'z'^^,). 

(b) if (y,y') is final and D ^ (£,£), return {wv, zx, z'x'). 

(c) if (y,y') already has a delay value ^ D and, hence, a path value P = (wi,zi,z(), then 
invoke completePath (y, y') to get a suffix triple {wqqi, Zqq>, z'^^,). Then, 

• If ZXZqqi 7^ z'x'z'^^, retum {wVWqq', ZXZqqi , z'x'z'^^,). 

• Else return {wiWqqi, ZlZqq' , z'^yz'qq,). 

(d) else assign {D,R) to (y,y') as delay-path value and continue the breadth-first process. 

5. At this point return (None,None,None), as the breadth-first process has been completed. 

Terminology. Let A = {wi,..., Wk) be a tuple consisting of words. The size \A\ of A is the number 
J2i=i -P {k—1). For example, |(0, 01,10)| = 7. If {Ai} is any set of word tuples then a minimal 
element (of that set) is any Ai whose size is the minimum of {|Ai|}. 

Theorem 9. If algorithm nonFunctionalW is given as input a standard form transducer t, then 
it returns either a size 0(|tp) witness oft’s non-functionality, or the triple (None,None,None) if 
t is functional. 
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Before we proceed with the proof of the above result, we note that there is a sequence (t^) of 
non-functional transducers such that jt^j —>■ oo and any minimal witness of t^’s non-functionality 
is of size 0(|tip). Indeed, let (pi) be the sequence of primes in increasing order and consider the 
transducer shown in Fig. 3. It has size Q{pi) and every output word w of has length equal 
to that of the input used to get w. The relation realized by is 

Any minimal witness of t^’s non-functionality is of the form Wi = such 

that mpi = n{pi + I). Using standard facts from number theory, we have that n > pi. Hence, 
|wi| > 2 -I- 3 X piipi + I), that is, |wi| = 0(|tjp). 



Figure 3: Transducers with quadratic size minimal witnesses of non-functionality. 


The following lemma is useful for establishing the correctness of the algorithm nonFunctionalW. 

Lemma 10. If a state {q,q') has a delay-path value ((s, s'), {a,/3,fi')) then there is a word h such 
that fd = hs and /3' = hs'. 

Proof. We use induction based on Definition 7. If the given delay-path value is ((e, e), (e, e, £)) 
the statement is true. Now suppose that there is a transition {(p,p'), (v,x,x'), (q,q')) such that 
the statement is true for state (p,p') (induction hypothesis) and ((s, s'), {a, 13,13')) results from a 
delay-path value (C, {w,z,z')) of {p,p'). As (s, s') ^ ZERO then also C ZERO, so C is of the 
form {y,y') and one of the three cases 1-3 of Definition 7 applies. Moreover, by the induction 
hypothesis on (p,p') we have z = gy and z' = gy', for some word g, hence, (3 = gyx and (3' = gy'x'. 
Now we consider the three cases. If y'x' = yxu then (s,s') = {e,u). Also, for h = gyx we have 
(3 = hs and /?' = hs', as required. If yx = y'x'u then (s, s') = {u, e) and one works analogously. If 
yx = y'x' then (s, s') = {e, e). Also, (3 = (3' and the statement follows using h = (3. □ 


Proof, (of Prop. 9) First note that the algorithm returns a triple other than (None,None .None) 
exactly the first time when one of the following occurs (i) a ZERO value for D is computed, or 
(ii) a value of D other than (e, e) is computed for a final state, or (iii) a value of D, other than 
the existing delay value, of a visited state is computed. Thus, the algorithm assigns at most one 
delay value to each state {q, q'). If the algorithm assigns exactly one delay value to each state and 
terminates at step 5, then its execution is essentially the same as that of the decision version of the 
algorithm, except for the fact that in the decision version no path values are computed. Hence, 
in this case the transducer is functional and the algorithm correctly returns (None,None,None) in 
step 5. 

In the sequel we assume that the algorithm terminates in one of the three subcases (a)—(c) 
of step 4. So let {q,q') be a state at which the algorithm computes some delay value D and path 
value R = {a, (3, (3 ')—see step 4. It is sufficient to show the following statements. 
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51 If D is ZERO then {awqq’, j3zqqi, j3' is a witness of t’s non-functionality. 

52 If {q,q') is final and D G {(e, u), (u,e)}, with u nonempty, then {a, (3,13') is a witness of t’s 

non-functionality. 

53 If D is of the form (s,s') and ((si,Sj), (oi,/3i,/3j)) is the existing delay-path value of {q,q') 

with (si, s'j^) ^ (s, s'), then one of the following triples is a witness of t’s non-functionality 

{aWqq>,PZqq',f3'z'qq,), {aiWgq', f3l 

^qq' j Pl^qq')- 


For statement SI, by Remark 8 , it suffices to show that (3zqq' ^ /3'z'^g,. First note that D is ZERO 
exactly when there is a transition ((p,p'), {v,x,x'), {q,q')) such that state {p,p') has a delay-path 
value {{y,y'),(w,z,z')) and yx,y'x' are of the form fag and fa'g', respectively, with a, a' being 
distinct letters, and a = wv, (3 = zx, (3' = z'x'. Now using the above lemma we get, for some h 


fiZqq! = zxZqq! = hyxZqq> = hfagZqqi and 
jS'z'qq, = z'x'z'gg, = hy'x'z'^g, = hfa'g'z'gg,, 
which implies I3zqqi P'z'gg, , as required. 

For statement S2, by Remark 8, it suffices to show that j3 j3'. By symmetry, we only con¬ 
sider the case oi D = {e,u). First note that D is {e,u) exactly when there is a transition 
{{p,p'), {v, X, x'), (g, q')) such that state (p,p') has a delay-path value {{y, y'), {w, z, z')) and y'x' = 
yxu, and a = wv, (3 = zx, j5' = z'x'. Using the above lemma we get, for some h 
j3 = zx = hyx and (3' = z'x' = hy'x' = hyxu, 
which implies (3 j3', as required. 


For statement S3, we assume that Pzqq' = f3'z'^g, and we show that /3iZqqi 7 ^ l^iz'qq,- Assume for 
the sake of contradiction that also PiZqq/ = jSiz'gg,. Using the above lemma we get, for some h 

f3 = hs, j3' = hs', /3i = hisi, f3'i = his'i- 

Also by the assumptions we get hszqqi = hs'z'gg, and hiSiZqq> = his'iz'gg,, implying that szqq' = 
s'z'gg, and siZqq> = s'lz'gg,. If Zqq' = z'gg, then s = s' = e and si = which is impossible as 

{s,s') 7 ^ (si,Si). If Zqq' is of the form ziz'gg, (or vice versa), then we get that (s, s') = (si,si), 
which is again impossible. 


Regarding the size of the witness returned, consider again statements SI-S3 above. Then, the 
size of the witness is \{x,y,z) \ + \ {wqqr, Zqq>, z'gg,) \ — 2 , where (wqqi, Zqq>, z'gg,) could be {e,e,e) and 
{x,y,z) is a path value of state {q,q')'. {a, 13,$') or (ai,/3i,/3(). As {wggi, Zgg>, z'gg,) is based on a 
shortest path from {q,q') to a final state of u, we have \{wgg', Zggi, z'gg,)\ < |u|. As the algorithm 
visits each transition of u at most once, and {x, y, z) is built by concatenating transition labels 
starting from label (e,e,£), we have that the size of {x,y,z) is bounded by the sum of the sizes 
of the transitions of u. Hence, the size of the witness is 0(|u|). The claim about the size of the 
witness follows by the fact that |u| = 0(|tp). □ 


5 Object Classes Representing Code Properties 

In this section we discuss our implementation of objects representing code properties. The set of all 
code properties is uncountable, but of course one can only implement countably many properties. 
So we are interested in systematic methods that allow one to formally describe code properties. 
Three such formal methods are the implicational conditions of [14] , where a property is described by 
a first order formula of a certain type, the regular trajectories of [7], where a property is described 
by a regular expression over {0,1}, and the transducers of [9], where a property is described by 
a transducer. These formal methods appear to be able to describe most properties of practical 
interest. The formal methods of regular trajectories and transducers are implemented here, as the 
transducer formal method follows naturally our implementation of transducers, and every regular 
expression of the regular trajectory formal method can be converted efficiently to a transducer 
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object of the transducer formal method. The implementation of implicational conditions is an 
interesting topic for future research. 

Next we review quickly the formal methods of regular trajectories and transducers, and then 
discuss our implementation of these formal methods. 

Regular trajectory properties [7]. In this formal method a regular expression e over {0,1} 
describes the code property Ve as follows. The Os in e indicate positions of alphabet symbols that 
make up a word u, say, and the Is in e indicate positions of arbitrary symbols that, together with 
the Os, make up a word u, say. A language L satisfies Vg if there are no two different words in 
u^v G L such that u has the structure indicated by e and v has a structure obtained by deleting 
the Is from e. For example, the infix code property is defined by the regular expression 1*0*1*, 
which says that by deleting consecutive symbols at the beginning and/or at the end of an L-word 
M, one cannot get a different L-word. Equivalently, L is an infix code if no L-word is an infix of 
another L-word. Note that 1*0*1* describes all infix codes over all possible alphabets. 

Input-altering transducer properties [9]. A transducer t is input-altering if, for all words w, 
w ^ t{w). In this formal method such a transducer t describes the code property consisting 
of all languages L over the input alphabet of t such that 

t(L)nL = 0. (1) 

With this formal method we can dehne the suffix code property: L is a suffix code if no L-word is 
a proper suffix of an L-word. The transducer defined in Example 4 is input-altering and describes 
the suffix code property over the alphabet {a, b}. Similarly, we can define the infix code property 
by making another transducer that, on input u;, returns any proper infix of w. We note that, for 
every regular expression e over {0,1} and alphabet S, one can construct in linear time an input- 
altering transducer t with input alphabet E such that Vg = Vf [9]. Thus, every regular trajectory 
property is an input-altering transducer property. 

Error-detecting properties via input-preserving transducers [9, 16]. A transducer t is 
input-preserving if, for all words w in the domain of R(t), w S t(w). Such a transducer t is 
also called a channel transducer, in the sense that an input message w can be transmitted via t 
and the output can always be w (no transmission error), or a word other than w (error). In this 
formal method the transducer t describes the error-detecting for t property V^‘^ consisting of all 
languages L over the input alphabet of t such that 

t{w) n (L — ru) = 0, for all words w G L. (2) 

The term error-detecting for t is used in the sense that L is meant to consist of all valid messages 
one can transmit via t, and t cannot turn a valid message into a different valid message. We 
note that, for every input-altering transducer t, one can make in linear time a channel transducer 
t' such that Vf = V^f [9]. Thus, every input-altering transducer property is an error-detecting 
property. 

Example 11. Consider the property 1-substitution error-detecting code over {a, b}, where error 
means the substitution of one symbol by another symbol. A classic characterization is that, L is 
such a code if and only if the Hamming distance between any two different words in L is at least 
2 [11]. The following channel transducer defines this property—see also Eig 4. The transducer 
will substitute at most one symbol of the input word with another symbol. 

si = ’STransducer 0 1 * 0\n’\ 

’0 a a 0\n’\ 

’0 b b 0\nA 
’0 b a l\n’\ 

’0 a b l\n’\ 

’1 a a l\n’\ 

’1 b b l\n’ 

tl = fio.readDneFromString(sl) 
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□ 

We note that the transducer approach to defining error-detecting code properties is very powerful, 
as it allows one to model insertion and deletion errors, in addition to substitution errors—see 
Fig 4. Codes for such errors are actively under investigation—see [24], for instance. 


a ja 


^Isub • 





Figure 4: On input x the transducer tisub outputs either x, or any word that results 
by substituting exactly one symbol in x. On input x the transducer tiM outputs 
either x, or any word that results by deleting, or inserting, exactly one symbol in 
X. Note: The use of labels on arrows is explained in Fig. 1. 


Error-correcting properties via input-preserving transducers. An input-preserving (or 
channel) transducer is used to describe the error-correcting for t property consisting of all 
languages L over the input alphabet of t such that 

t(u) n t{w) = 0, for all distinct words v,w G L. (3) 

The term error-correcting for t is used in the sense that any message w' received from an L-word 
via t can result from only one such L-word, so if w' ^ L then w' can in principle be corrected to 
exactly one L-word. 

Remark 12. It can be shown that a language is error-correcting for t if and only if it is error¬ 
detecting for any transducer realizing the relation R(t“^) o R(t). 

Remark 13. All input-altering, error-detecting and error-correcting properties are 3-independences. 

5.1 Implementation in FAdo. 

We present now our implementation of the previously mentioned code properties. We have defined 
the Python classes 

TrajProp, lATProp, ErrDetectProp, ErrCorrectProp 

corresponding to the four types of properties discussed above. These four property types are de¬ 
scribed, respectively, by regular trajectory expressions, input-altering transducers, input-preserving 
transducers, and input-preserving transducers. In all four cases, given a transducer object, an ob¬ 
ject of the class is created. An object p of the class lATProp, say, is defined via some transducer 
t and represents a particular code property, that is, the class of languages satisfying Eq. (1). 

The class ErrDetectProp is a superclass of the others. These classes and all related methods and 
functions are in the module codes.py and can be imported as follows. 

import FAdo.codes as codes 

Although each of the above four classes requires a transducer to create an object of the class, we 
have defined a set of what we call build functions as a user interface for creating code property 
objects. These build functions are shown next in use with specific arguments from previous 
examples. 

Example 14. Consider again Examples 4 and 11 in which the strings s and si are defined 
containing, respectively, the proper suffixes transducer and the transducer permitting up to 1 
substitution error. The following object definitions are possible with the FAdo package 
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icp = codes.buildTrajPropSC’1*0*1*’, {’a’, ’b'}) 

scp = codes.buildlATPropS(s) 

sldp = codes.buildErrorDetectPropS(si) 

slop = codes.buildErrorCorrectPropS(si) 

pep = codes.buildPrefixProperty({’a’, ’b’}) 

icp2 = codes.buildInfixProperty({’a’, ’b’}) 

In the first statement, icp represents the infix code property over the alphabet {a, b} and is 
defined via the trajectory expression 1*0*1*. In the next three statements, scp, sldp, slep 
represent, respectively, the suffix code property, the 1-substitution error-detecting property and 
the 1-substitution error-correcting property. The last two statements are explained below— pep 
and icp2 represent the prefix code and infix code properties, respectively. □ 

Fixed properties. Some properties are well known in the theory of codes, so we have created 
specific classes for these properties and, therefore, FAdo users need not write transducers, or 
trajectory regular expressions, for creating these properties. As before, users need only to know 
about the build-interfaces for creating objects of these classes. 

buildPrefixProperty(Sigma) : returns an object of the class PrefixProp that represents all 
prefix codes over the alphabet Sigma. 

buildSuffixProperty(Sigma) : returns an object of the class SuffixProp that represents all 
suffix codes over the alphabet Sigma. 

buildInfixProperty (Sigma) : returns an object of the class InfixProp that represents all infix 
codes over the alphabet Sigma. Note that an infix code is both prefix and suffix and this fact is 
reflected in the class definition, by considering InfixProp a Python subclass of both PrefixProp 
and SuffixProp. 

buildOutfixProperty(Sigma) : returns an object of the class OutfixProp that represents all 
outfix codes over the alphabet Sigma. A language L is an outfix code if deleting an infix of an 
L-word cannot result into another L-word. Note that an outfix code is both prefix and suffix, and, 
as above, OutfixProp is a subclass of the correspondent Python classes. 

buildHypercodeProperty (Sigma) : returns an object of the class HypercodeProp that represents 
all hypercodes over the alphabet Sigma. A language L is a hypercode if deleting any symbols of 
an L-word cannot result into another L-word. Note that a hypercode is both infix and outfix and 
as above, HypercodeProp is a subclass of the correspondent Python classes. 

Each of the above methods creates internally a transducer whose input and output alphabets 
are equal to the given alphabet Sigma, and then passes the transducer to the constructor of the 
respective class. 

Combining code properties. In practice it is desirable to be able to talk about languages that 
satisfy more than one code property. For example, most of the 1-substitution error-detecting codes 
used in practice are infix codes (in fact block codes, that is, those whose words are of the same 
length). We have defined the operation & between any two error-detecting properties independently 
of how they were created. This operation returns an object representing the class of all languages 
satisfying both properties. This object is constructed via the transducer that results by taking the 
union of the two transducers describing the two properties—see Rational Operations in Section 3. 

Example 15. Using the properties icp, sldp created above in Example 14, we can create the 
conjunction pi of these properties, and using the properties pep, scp we can create their con¬ 
junction bep which is known as the bifix code property. 

pi = icp & sldp 
bep = pep & scp 
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The object pi represents the property Ve H Vlf, where e = 1*0*1*. It is of type ErrDetectProp. 
If, however, the two properties involved are input-altering then our implementation makes sure 
that the object returned is also of type input-altering—this is the case for bcp. This is important 
as the satisfaction problem for input-altering transducer properties can be solved more efficiently 
than the satisfaction problem for error-detecting properties—see Section 6. □ 

5.2 Aspects of Code Hierarchy Implementation 

As stated above, our top Python superclass is ErrDetectProp. When viewed as a set of (potential) 
objects, this class implements the set of properties 

•ped _ j-ped | ^ jg input-preserving transducer}. (4) 

For any ErrDetectProp object p, let us denote by [p] the property in represented by p. If 
p and q are any ErrDetectProp objects such that [p] C [q] and we know that a language satisfies 
[p] then it follows logically that the language also satisfies [q] and, therefore, one does not need 
to execute the method q.notSatsf iesW on the automaton accepting that language. Similarly, as 
[p&q] = [p] the method invocation p&q should simply return p. It is therefore desirable to have a 
method ‘<’ such that if p < q returns true then [p] C [q]. 

In fact, for ErrDetectProp objects, we have implemented the methods and ‘<’ in a way that 
the triple (ErrDetectProp, &, <) constitutes a syntactic hierarchy (see further below) which can 
be used to simulate the properties in (4). In practice this means that simulates intersection 
between properties and ‘<’ simulates subset relationship between two properties such that the 
following desirable statements hold true, for any ErrDetectProp objects p, q 

p & p returns p 

p < q if and only if p & q returns p 

We note that the syntactic simulation of the properties in Eq. (4) is not complete (in fact it 
cannot be complete): for any ErrDetectProp objects p, q defined via transducers t and s with 
'pad g -pod (jQgg gQi; always hold that p < q. On the other hand, our implementation of the set 
of the five fixed properties constitutes a complete simulation of these properties, when the same 
alphabet is used. This implies, for instance, that 

pep & icp2 returns icp2 

where we have used the notation of Example 14. Our implementation associates to each object 
p of type ErrDetectProp a nonempty set p.ID of names. If p is a fixed property object, p.ID has 
one hardcoded name. If p is built from a transducer t, p.ID has one name, the name of t —this 
name is based on a string description of t. If p = q&r, then p.ID is the union of q.ID and r.ID 
minus any fixed property name N for which another fixed property name M exists in the union 
such that the M-property is contained in the Wproperty. 

Next we give a mathematical definition of what it means to simulate a set of code properties 
Q = {Qj \ j £ J} via a syntactic hierarchy (G, &, <)—see definition below—which can ultimately 
be implemented (as is the case here) in a standard programming language. The idea is that each 
g £ G represents a property [ 5 ] = Qj, for some index j, and G is the set of generators of the 
semigroup ((G), &) whose operation simulates the process of combining properties in Q, that 
is [cc&y] = [a;] n [y], and the partial order ‘<’ simulates subset relation between properties, that is 
X < y implies [x] C [y], for all x,y £ (G). 

The first result is that there is an efficient simulation of the set in Eq. (4) —see Theorem 19. 
The second result is that there can be no complete simulation of that set of properties, that is, a 
simulation such that [a:] C [y] implies x <y, for all x,y £ (G)—see Theorem 20. 

Definition 16. A syntactic hierarchy is a triple (G, &, <) such that G is a nonempty set and 
1. ((G), &) is the commutative semigroup generated by G with computable operation 
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2- ((G), <) is a decidable partial order (reflexive, transitive, antisymmetric). 

3. For all x,y G (G), x < y implies xSzy = x. 

4. For all x,y & (G), xhy < x. 

Next we list a few properties of the operation and the order 

Lemma 17. The following statements hold true, for all x,y,z G (G), 

1. X < X and xSzx = x 

2. X < y if and only if x = ySzz for some z G (G). 

3. X = xlky if and only if x < y. 

4- If X < y and x < z then x < ySzz. 

5. If X = (/i& • • • SzQn, for some gi,... ,gn G G, with all gi’s distinet and n> 2, then x < gi or 

X < g 2 , and hence x is not maximal. 

6. X is maximal if and only if x is prime (meaning, x = utkv implies x = u = v). 

Proof. The proof of correctness is based on the previous definition and uses standard logical 
arguments. We present only proofs for the second and fourth statements. 

The ‘if’ part of the second statement follows from the fourth statement of the above definition, 
and the ‘only if’ part follows from the third statement of the above definition. 

For the fourth statement, using the fact that a;&(y&z) < ySzz, it is sufficient to show that 

X = xSz{yhz). This follows when we note that x < y implies x = xhy and x < z implies 

X = x&z. □ 


Definition 18. Let Q = {Qj | j € J} be a set of properties, for some index set J. A (syntactic) 
simulation of Q is a quintuple (G, &,<,[], i^) such that (G, &, <) is a syntactic hierarchy and 

1. [] is a surjective mapping of (G) onto Q; 

2. for all x,y G (G), x <y implies [x] C [j/]; 

3. for all x,y G (G), [xhy] = [x] n [y]; 

4. (/? is a computable function of J into (G) such that [<p{j)] = Qj. 

The simulation is called complete if, for all x, y 

[a;] C [y] implies x < y. 

The simulation is called linear if J has a size function | • | and (G) has a size function || • || such 
that ||</5(j)|| = 0(|jD, for all j € J, and for all x,y 

\\xty\\ = 0(||a:|| + ||y||). 

By a size function on a set X, we mean any function / of A into Nq. 

Theorem 19. There is a linear simulation of the set 

{.pod I ^ input-preserving transducer}. 
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Proof. Let T be the set consisting of all finite sets of transducers. Let Ti,T 2 G T. We define 
G = {{t} I t is an input-preserving transducer}. 

T1&T2 = Ti U T2. 

Ti < T 2 , if T 2 C Ti. 

The above definitions imply that (G) consists of all T, where T is a finite nonempty set of input¬ 
preserving transducers, and that indeed ((G), &) is a commutative semigroup and ((G), <) is a 
partial order. Moreover one verifies that the last two requirements of Definition 16 are satisfied. 
Thus (G, &, <) is a syntactic hierarchy. 

Next we define the size function. For a finite nonempty set T = {ti,... ,t„} of transducers, we 
denote with VT the transducer ti V • • • V t„ of size 0(^}' |ti|) realizing R(ti) U • • • U R(t„). Then, 
define ||r|| = | V T|. One verifies that IIT 1 &T 2 II = 0(||Ti|j -|- IIT 2 II) as required. 

Next we use the syntactic hierarchy (G, &, <) to define the required simulation. First, define 
ip{t) = {t}, for any input-preserving transducer t. Then, define 

[r] = Vy'ip, which equals P| 

tsT 

One verifies that the requirements of Definition 18 are satisfied. □ 

The next result shows that there can be no complete simulation of the set of error-detecting 
properties. This follows from the undecidability of the Post Correspondence problem using meth¬ 
ods from establishing the undecidability of basic transducer related problems [4] . 

Theorem 20. There is no complete simulation of the set of properties 

\ t is an input-preserving transducer}. 

Before we present the proof, we establish a few necessary auxiliary results. 

Lemma 21. For any input-preserving transducers t, s we have 

-pod g -pad g t“^). 

Proof. First note that Eq. (2) is equivalent to 

{u),v) G R(t) implies w = v, for all v,w G L. 

This implies that, for all input-preserving transducers t,s, we have and 

R(s) C R(t) implies C pf, (5) 

'Pf = ( 6 ) 

The statement of the lemma follows from the above observations using standard set theoretic 
arguments. □ 

Lemma 22. The following problem is undecidable. 

Input: input-preserving transducers t,s 
Return: whether R(s V s“^) C R(t V t“^) 

Proof. We reduce the Post Correspondence Problem (PCP) to the given problem. Consider any 
instance ((ui)i, (vi)^) of PCP which is a pair of sequences of p nonempty words over some alphabet 
R, for some positive integer p. As mentioned before, we use tools that have been used in showing 
the undecidability of basic transducer problems [4]. In particular, we have that the given instance 
is a YES instance if and only if C V'^ 0, where 

U = {{ab\ui) \ i = l,...,p} and V = {(ab\vi) | j = 1,... ,p}, 
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and we make no assumption about the intersection of the alphabets B and {a, &}. Here we define 
the following objects 


G 

= {a 6 , a 6 ^,..., 06 ^} 

(7) 

diag(L) 

= {(x,x) X S L}, for any language L 

(8) 

D 

= diag(G+) U diag(aaR+) 

(9) 

X 

= (e, aa)U~^ U D 

(10) 

Y 

= ((G"*" X aaB~^) — (e, aa)V~^) U D 

(11) 

t 

= any transducer realizing X 

(12) 

s 

= any transducer realizing Y 

(13) 


We use rational relation theory to show the following claims, which establish the required reduction 
from PCP to the given problem. 

Cl: X and Y are rational relations 
C2: t and s are input preserving 

C3: t/+ n P+ 0 if and only if {X U X-^) C (F U Y-^) 

Claim Cl: First note that, as C’*" and aaB^ are regular languages, we have that £> is a rational 
relation. In [4], the author shows that U~^ and x B'^) — P+) are rational relations. It follows 
then that X is a rational relation. Similarly, rational is also the relation 

(e,aa)((C'+ x B+) - V+) = {{C+ x aaB+) - (e, aa)V+), 

which implies that Y is rational as well. 

Claim C2: From the previous claim there are transducers (in fact effectively constructible) t 
and s realizing X and Y, respectively. Note that the domains of both transducers are equal to 
C“''Uaai?+. The fact that (cc, x) G R(t)nR(s) for all x S C~'~UaaB^ implies that both transducers 
are indeed input-preserving. 

Claim C3: First it is easy to confirm that X~^ C Y~^ if and only if X C F (in fact for any 
relations X and F). Also the fact that C'*' naaB~^ = 0 implies that (e, aa)U^ is disjoint from the 
sets 

((e,aa)t/+)-\ D, ((C+ x aaB+) - {e,aa)V+)-^ 

and similarly ((C'"*' x aaB~^) — {e,aa)V~^) is disjoint from the same sets. The above observations 
imply that 

{X U X~^) C (F U Y~^) if and only if 

(e, aa)U~^ C ((<7+ x aaB^) — (e, aa)F+) if and only if 

(e, aa)U^ n ({a, b}* x B*) — ((<7+ x aaR+) — (e, aa)F+) = 0 if and only if 

(e, aa)U~^ (7 (e, aa)F+ = 0 if and only if 

17+ n F+ = 0, as required. □ 

Proof, (of Theorem 20.) For the sake of contradiction, assume there is a complete simulation 
(G, &,<,[], (/j) of the given set of properties. Then, for all x,y G (G), we have 

[a^] 7 [y] implies x <y. 

We derive a contradiction by showing that the problem in Lemma 22 is decidable as follows. 

1. Let X = 75 ( 1 ) 

2. Let y = ip{s) 

3. if y < x: return YES 

4. else: return NO 
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The correctness of the ‘if’ clause is established as follows: y < x implies [y] C [a;], which implies 
pcd (2 then R(s V s“^) C R(t V t“^), as required. The correctness of the ‘else’ clause is 

established as follows: first note y ^ x. We need to show R(s V s“^) % R(t V t“^). For the sake 
of contradiction, assume the opposite. Then, T’l'^ C , which implies [y] C [a;], and then (by 
completeness) y < x, which is a contradiction. □ 

6 Methods of Code Property Objects 

In the context of the research on code properties, we consider the following three algorithmic 
problems as fundamental. 

Satisfaction problem. Given the description of a code property and the description of a lan¬ 
guage, decide whether the language satisfies the property. In the witness version of this 
problem, a negative answer is also accompanied by an appropriate set of words showing how 
the property is violated. 

Maximality problem. Given the description of a code property and the description of a language 
L, decide whether the language is maximal with respect to the property. In fact we allow the 
more general problem, where the input includes also the description of a second language M 
and the question is whether there is no word w G M\L such that LUw satisfies the property. 
The default case is when M = T,*. In the witness version of this problem, a negative answer 
is also accompanied by any word w that can be added to the language L. 

Construction problem. Given the description of a code property and two positive integers n 
and £, construct a language that satisfies the property and contains n words of length £ (if 
possible). 

It is assumed that the code property can be implemented as p via a transducer t (whether input- 
altering or input-preserving) and, in the first two problems, the language is given via an NFA a. 
In the maximality problem, the second language M is given via an NFA b. In fact one can give a 
language via a regular expression, in which case it is converted to an automaton. Next we present 
the implementation of methods for the satisfaction and maximality problems. We discuss briefly 
the construction problem in the last section of the paper. 

Methods p. satisfies?(a) 

The satisfaction problem is decidable in time 0(|t||ap), if the property p is of the input-altering 
transducer type. This follows from Eq. (1) and can be implemented as follows 

c = t.runDnNFA(a) 

return (a & c).emptyP() 

For the case of p being the error-detecting property, the transducer t is input-preserving and 
Eq. (2) is decided via a transducer functionality test [16]. In EAdo this test can be implemented 
as follows, where the method functionalPO returns whether the transducer is functional. 

s = t.inlntersection(a) 

return s.outlntersection(a).functionalPO 

For the case of p being the error-correcting property, again the given transducer is input-preserving 
and Eq. (3) is decided via a transducer functionality test [16]. In FAdo this test can be implemented 
as follows 

s = t.inverse!) 

return s.outlntersection(a).functionalPO 
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Method p.maximalP(a, b) 

The maximality problem is decidable but PSPACE-hard [9]. In particular, for the case of both 
input-altering transducer and error-detecting properties, the decision algorithm is very simple: the 
language L(a) is p-maximal (within L(b)) if and only if 

L(b) \ (L(a) U t(a) U t“^(a)) = 0. (14) 

The above emptiness test is implemented in the method p.maximalP(a, b), which returns True 
or False, and uses standard NFA methods as well as the transducer methods t. inverse () and 
t.runOnNFA(a). For the case of error-correcting properties, our implementation makes use of 
Remark 12. 

Methods with witnesses: p.notMaximalWCa, b) 

It can be shown that any word belonging to the set shown in Eq. (14) can be added to L(a) and 
the resulting language will still satisfy the property [9]. This word can serve as a witness of the 
non-maximality of L(a). If no such word exists, the method returns None. 

Methods with witnesses: p.notSatisfiesW(a) 

For input-altering transducer and error-detecting properties, the witness version of the method 
p. satisf iesP (a) returns either a pair of different words u,v G L(a) violating the property, that 
is, V € t(u) or u € t(v), or they return the pair (None, None). In the former case, the pair (u, v) is 
called a witness of the non-satisfaction of p by the language L(a). For error-correcting properties 
p, a witness of non-satisfaction by L(a) is a triple of words (z, u, v) such that u,v G L(a) and u v 
and z G t(u) H t(v). Next we discuss how to accomplish this by changing the implementations of 
p. satisf iesP (a) shown before. 

Case 1: For input-altering transducer properties, we have the Python code 

return t.inlntersection(a).outlntersection(a).nonEmptyWO 

Recall from Section 3, the above returns (if possible) a witness for the nonemptiness of the 
transducer t when the input and output parts of t are intersected by the language L(a). 
This witness corresponds to the emptiness test in Eq. (1), as required. 

Case 2: For error-detecting properties, the defining transducer is a channel (input-preserving) 
and, therefore, we use the method nonFunctionalWO instead of nonEmptyWO. More specif¬ 
ically, we use the code 

u, V, w = t.inlntersection(a).outlntersection(a).nonFunctionalWO 
if u == v: 

return u, w 
else: 

return u, v 

If the method nonFunctionalWO returns a triple of words (u,v,w) then, by Proposition 9 
and the definitions of the in/out intersection methods, we have that v w, v G t{u), 
w G t{u) and all three words are in L(a). This implies that at least one ot u v and 
u w must be true and, therefore, the returned value is the pair (u,v) or (u,w). Moreover, 
the returned pair indeed violates the property. Conversely, if the non-functionality method 
returns a triple of Nones then the constructed transducer is not functional. Then L(a) must 
satisfy the property, otherwise any different words v,w G L(a) violating the property could 
be used to make the triple {v,v,w), or (w,w,v), which would serve as a witness of the 
non-functionality of the transducer. 
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Case 3: For error-correcting properties, we use again the non-functionality witness method. 

return t.inverseO.outlntersection(a).nonFunctionalWO 

For the correctness of this algorithm, one argues similarly as in the previous case. 

The above discussion establishes the following consequence of Proposition 9 and the definitions of 
product constructions in Section 3. 

Proposition 23. The algorithms implemented in the three forms (input-altering transducer, error¬ 
detecting, error-correcting) of the method p.notSatisf iesW(a) correctly returns a witness of the 
non-satisfaction of the property p by the language L(a). 

Example 24. The following Python interaction shows that the language a*b is a prefix and 
1-error-detecting code. Recall from previous examples that the Python strings st, si contain, 
respectively, the descriptions of an NFA accepting a*b, and a channel transducer that performs 
up to one substitution error when given an input word. 

>>> a = fio.readDneFromString(st) 

>>> pep = codes.buildPrefixProperty({’a’,’b’}) 

>>> sldp = codes.buildErrDeteetPropS(si) 

>>> p2 = pep & sldp 
>>> p2.notSatisfiesW(a) 

(None, None) 

□ 

7 Uniquely Decodable/Decipherable Codes 

The property of unique decodability or decipherability, UD code property for short, is probably the 
first historically property of interest in coding theory from the points of view of both information 
theory [29] as well as formal language theory [23,30]. A language L is said to be a UD code if, for 
any two sequences (ui)" and (uj)™ of L-words such that Ui ■ • • Un = Vi • • ■ Vm, we have that n = m 
and the two sequences are identical. In simpler terms, every word in L* can be decomposed in 
exactly one way as a sequence of L-words. In this section we describe our implementation of the 
satisfaction and maximality methods for the UD code property, as the corresponding methods for 
error-detecting properties discussed above are not applicable to the UD code property. 

Remark 25. In [15], as a consequence of a more general result, it is shown that the UD code 
property is not an n-independence for any n < Hq. Thus, this property is not an error-detecting 
property, so it is not describable by any input-preserving transducer. A specific argument showing 
that this property is not a 3-independence is as follows: the language L = {a,ab,ba} is not a 
UD code, as a{ba) = {ab)a, but every subset of L having < 3 elements is a UD code. 

We assume that a is an NFA object without e-transitions. One can create the UD code 
property using the following syntax 

p = codes.buildUDCodeProperty(a.Sigma) 

The method p.notSatisfiesW(a) 

The satisfaction problem for this property was discussed first in the well known paper [29] , where 
the authors produce a necessary and sufficient condition for a finite language to be a UD code—we 
note that some feel that that condition does not clearly give an algorithm, as for instance the papers 
[18,20]. Over the years people have established faster algorithms and generalized the problem to 
the case where the language in question is regular. To our knowledge, the asymptotically most 
efficient algorithms for regular languages are given in [12,22], and they are both of quadratic time 
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complexity. Our implementation follows the algorithm in [12], as that approach makes use of the 
transducer functionality algorithm. As before we enhance that algorithm to produce a witness of 
non-satisfaction which, given an NFA object a, it either returns the pair (None, None) if L(a) is a 
UD code, or a pair of two different lists of L(a)-words such that the concatenation of the words 
in each list produces the same word (if L(a) is not a UD code). 

We now describe the algorithm in [12] modified appropriately to return a witness of non-satisfaction. 
Again, the heart of the algorithm is the transducer functionality method. Let a = {Q, 
be the given NFA (with no e-transitions). 

1. If any of the initial states is final, then (as e is in the language) return ([e], [e, e]). 

2. Construct the transducer t = (Q,I],{0,1},T',/,/) in which the transition set T' is defined 
by the following process: If {p, a,q) £T then (p, cr/0, q) S T' and, if in addition q £ F then 
also {p,a/l,i) G T', for every i G L Note that the domain of the transducer is exactly the 
language L(a)*. 

3. Let w, X, y = t.nonFunctionalW(a) 

4. If any of w, x, y is None then return (None, None) 

5. At this point, we know that w G L(a), x and y are different and each one is of the form 

Moreover, each of x and y corresponds to a decomposition of w in terms of L(a)-words. More 
specifically, the binary word • • • O’’’*! encodes a sequence of words (wi)i such that their 
concatenation is equal to w and each Wi is the infix of w that starts at position Si and ends 
at position Si -\- r^, where Si = 0 and = \wi\ — I and -I- -I- 1. For example, if 

w = ababab and x = 010001, then the decomposition is {ab,abab). The algorithm returns 
the two lists of words corresponding to x and y. 

Example 26. The following Python interaction produces a witness of the non-satisfaction of the 
UD code property by the language {a&, abba, bab}. 

»> L = fl.FL(['abU ’abba’, ’bab’]) 

>>> a = L.toNFAO 

>>> p = codes.buildUDCodeProp(a.Sigma) 

>>> p.notSatisfiesW(a) 

([’ab’, ’bab’, ’abba’, ’bab’], [’abba’, ’bab’, ’bab’, ’ab’]) 

The two word lists are different, but the concatenation of the words in each list produces the same 
word. □ 

The method p.maximal?(a) 

This method is based on the fundamental theorem of Schutzenberger [5] that a regular language L 
is a UD code if and only if the set of all infixes of L* is equal to S*. Using the tools implemented 
in FAdo this test can be performed as follows. 

t = inf ixTrsinsducer (a. Sigma, True) 
b = a.starO 

return (~(t.runOnNFA(b))).empty?() 

The first statement above returns a transducer t that, on input w, outputs any infix of w. The 
second statement returns an NFA accepting L(a)*, and the last statement returns whether there 
is no word in the complement of all infixes of L(a)*. 
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8 LaSer and Program Generation 

The first version of LaSer [9] was a self-contained set of automaton and transducer methods 
as well as a set of Python and HTML documents with the following functionality: a client uploads 
a file containing an automaton in Grail format and a file containing either a trajectory automaton, 
or an input altering-transducer, and LaSer would respond with an answer to the witness version 
of the satisfaction problem for input-altering transducer properties. The new version, which we 
discuss here, is based on the FAdo set of automaton and transducer methods and allows clients 
to request a response about the witness versions of the satisfaction and maximality problems for 
input-altering transducer, error-detecting and error-correcting properties. 

We call the above type of functionality, where LaSer computes and returns the answer, the online 
service of LaSer. Another feature of the new version of LaSer, which we believe to be original in 
the community of software on automata and formal languages, is the program generation service. 
This is the capability to generate a self-contained Python program that can be downloaded on the 
client’s machine and executed on that machine returning thus the desired answer. This feature is 
useful as the execution of certain algorithms, even of polynomial time complexity—see the error- 
detection satisfaction problem-, can be quite time consuming for a server software like LaSer. 
The user interface of LaSer is very simple—see Fig. 5. The user provides the file containing the 


I-LaSer 

Independent (formal) Language Server — About 


1-LaSer answers questions about regular languages and independence properties—prefix code property, suffix code 
property,..., error-detection and error-correction properties. 

• Satisfaction question: Given language L and property P, does L satisfy P? 

• Maximality question: Given language L and property P, is L maximal with respect to P? 

Note: in general, this is PSPACE-hard. 

• To Do: Construction question: Given property P and integers N,k>0, return a language of N words of length k 
satisfying P. 

(See Technical Notes below) 


Provide a language (via an automaton): 


Decide satisfaction or maximality? [-Please Select- v| 


1 Submit Request | 


Generate Program 


Clear | 


Format for Automaton Format for Transducer Format for Trajectory Set 


SOME TECHNICAL NOTES 

Figure 5: The main user interface of LaSer 


automaton whose language will be tested for a question (satisfaction or maximality) about some 
property. When the decision question is chosen, then the user is asked to enter the type of property, 
which can be one of fixed, trajectory, input-altering transducer, error-detecting, error-correcting. 
Then, the user either clicks on the button “Submit Request” or on the button “Generate Program”. 

Next we present some parts of the program generation module. The program to be generated will 
contain code to answer one of the satisfaction or maximality questions for any of the properties 
regular trajectory (TRAJECT), input-altering transducer (INALT), error-detecting (ERRDET), 
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error-correcting (ERRCORR), or any of the fixed properties. First a Python dictionary is prepared 
to enable easy reference to the property type to be processed, and another dictionary for the 
question to be answered, as follows 

buildName = {"CODE": ("buildUDCodeProperty", ["a.Sigma"], 1), 

"ERRCORR": ("buildErrorCorrectPropS", ["t"], 1), 

"ERRDET": ("buildErrorDetectPropS", ["t"], 1), 

"HYPER": ("buildHypercodeProperty", ["a.Sigma"], 1), 

"INALT": ("buildlATPropS", ["t"], 1), 

"INFIX": ("buildlnfixProperty", ["a.Sigma"], 1), 

"OUTFIX": ("buildOutfixProperty", ["a.Sigma"], 1), 

"PREFIX": ("buildPrefixProperty", ["a.Sigma"], 1), 

"SUFFIX": ("buildSuffixProperty", ["a.Sigma"], 1), 

"TRAJECT": ("buildTrajPropS", ["$strexp", "$sigma"], 1) 

> 


tests = {"MAXP" 
"MAXW" 
"SATP" 
"SATW" 


"maximalP", 
"notMaximalW", 
"satisfiesP", 
"notSatisfiesW"}. 


Next we show the actual program generation function which takes as parameters the property 
name (pname), which must be one appearing in the dictionary buildNemie, the question to answer 
(test), the file name for the automaton (aname), the possible trajectory regular expression string 
and alphabet (strexp and sigma), and the possible file name for the transducer (tname). The 
function returns a list of strings that constitute the lines of the desired Python program—we have 
omitted here the initial lines that contain the commands to import the required FAdo modules. 


01 def program!(pname, test=None, aname=None, strexp=None, sigma=None, 
tnamie=None) : 

02 def expand(s): 

03 si = Template(s) 

04 return si.substitute(strexp=strexp, sigma=sigma) 


05 

06 

07 

08 

09 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 


1 = listO 

l.appendC'ax = \"7,s\"\n" “/ base64.b64encode(ainame)) 
l.appendC'a = readDneFromString(base64.b64decode(ax))\n") 
if buildName[pname][2] == 1: 
if tname: 

l.appendC'tx = \"°/os\"\n" y, base64.b64encode(tname)) 
1.append("t = base64.b64decode(tx)\n") 
s = "p = " + buildName [pnamie] [0] + "(" 
for si in buildNamie [pname] [1] : 
if si == "$strexp": 

s += "t, " 
else: 

s += "7,3," 7o expand(sl) 
s = s[:-l] + ")\n" 

1.append(s) 

1. append ("print p.7.s(a)\n" 7. tests [test]) 
else: . 


We refer to the lines above as meta-lines, as they are used to generate lines of the desired Python 
program. Meta-line 06 above generates the first line of the program, which would read the au¬ 
tomaton file in a string ax that is encoded in binary to allow for safe transmission and reception 
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over different operating systems. The next meta-line generates the line that would create an NFA 
object from the decoded string ax. The if part of the if-else statement is the one we use in 
this paper—the else part is for other LaSer questions. If there is a transducer file then, as in the 
previous case of the automaton file, LaSer generates a line that would create an SFT object from 
the encoded file. Then, meta-lines 12-18 generate a line that would create the property p to be 
processed, using the appropriate build-property function. Finally, meta-line 20 generates the line 
that would print the result of invoking the desired method test of the property p. 


9 Concluding Remarks 

We have presented a simple to use set of methods and functions that allow one to define many 
natural code properties and obtain answers to the satisfaction and maximality problems with 
respect to these properties. This capability relies on our implementation of basic transducer 
methods, including our witness version of the non-functionality transducer method, in the FAdo 
set of Python packages. We have also produced a new version of LaSer that allows clients to 
inquire about error-detecting and -correcting properties, as well as to generate programs that can 
be executed and provide answers at the client site. 

There are a few important directions for future research. First, the existing implementation of 
transducer objects is not always efficient when it comes to describing code properties. For example, 
the transducer defined in Example 11 consists of 6 transitions. In general, if the alphabet is of size 
s, then that transducer would require s + s{s — 1) -I- s = -I- s transitions. However, a symbolic 

notation for transitions, say of the form 

0 @s @s 0 

0 @s 1 

1 Os @s 1 

is more compact and can possibly be used by modifying the appropriate transducer methods. In 
this hypothetical notation, Os represents any alphabet symbol and Os ’ represents any alphabet 
symbol other than the previous one. Of course the syntax of such transducer descriptions has to be 
defined carefully so as to be as expressive as possible and at the same time efficiently utilizable in 
transducer methods. We note that a similar idea for symbolic transducers is already investigated 
in [35]. 

Formal methods for defining code properties need to be better understood or new ones need to 
be developed with the aim of ultimately implementing these properties and answering efficiently 
the satisfaction problem. Moreover, these methods should be capable of allowing to express 
properties that cannot be expressed in the transducer methods considered here. In particular, 
all transducer properties in this work are independences with parameter n = 3, so they do not 
include, for instance, the comma-free code property. A language L is a comma-free code [31] if 

LLnE+LE+ =0. 

The formal method of [14] is quite expressive, using a certain type of first order formulae to 
describe properties. It could perhaps be further worked out in a way that some of these formulae 
can be mapped to finite-state machine objects that are, or can be, implemented in available formal 
language packages like FAdo. We also note that if the defining method is too expressive then even 
the satisfaction problem could become undecidable—see for example the method of multiple sets 
of trajectories in [8]. 

We consider the construction problem to be the Holy Grail of coding theory. We believe that as 
long as the satisfaction problem is efficiently decidable one can use an algorithmic approach to 
address the problem to some extent. For example, we already have implemented a randomized 
algorithm that starts with an initial small language L (in fact a singleton one) and then performs 
a loop in which it does the following task. It randomly picks words w of length I that are not in 
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L until LU w satisfies the property, in which case L becomes L U tc, or no such w is found after 
a MAX value of trials. Although this is a simple approach, we feel that it can lead to further 
developments with respect to the construction problem. 
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